Fruit Carts: A Domain and Corpus for Research in Dialogue Systems and Psycholinguistics

نویسندگان

  • Gregory Aist
  • Ellen Campana
  • James F. Allen
  • Mary D. Swift
  • Michael K. Tanenhaus
چکیده

We describe a novel domain, Fruit Carts, aimed at eliciting human language production for the twin purposes of (a) dialogue system research and development and (b) psycholinguistic research. Fruit Carts contains five tasks: choosing a cart, placing it on a map, painting the cart, rotating the cart, and filling the cart with fruit. Fruit Carts has been used for research in psycholinguistics and in dialogue systems. Based on these experiences, we discuss how well the Fruit Carts domain meets four desired features: unscripted, context-constrained, controllable difficulty, and separability into semi-independent subdialogues. We describe the domain in sufficient detail to allow others to replicate it; researchers interested in using the corpora themselves are encouraged to contact the authors directly.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling Multi-issue Bargaining Dialogues: Data Collection, Annotation Design and Corpus

The paper describes experimental dialogue data collection activities, as well semantically annotated corpus creation undertaken within EU-funded METALOGUE project. The project aims to develop a dialogue system with flexible dialogue management to enable systems adaptive, reactive, interactive and proactive dialogue behaviour in setting goals, choosing appropriate strategies and monitoring numer...

متن کامل

Towards a psycholinguistics of dialogue: defining reaction time and error rate in a dialogue corpus

This study uses the multi-level coding of a designed corpus of unscripted task-oriented dialogues to demonstrate that time to respond (Inter-Move Interval, IMI) and rate of disfluency behave like psycholinguistic measures, reaction time and error rate, in reflecting the speakers’ cognitive burdens. Multiple-regression analyses show that IMI is sensitive to social distance between interlocutors,...

متن کامل

A Preliminary Investigation of Hierarchical Hidden Markov Models for Tutorial Planning

For tutorial dialogue systems, selecting an appropriate dialogue move to support learners can significantly influence cognitive and affective outcomes. The strategies implemented in tutorial dialogue systems have historically been based on handcrafted rules derived from observing human tutors, but a data-driven model of strategy selection may increase the effectiveness of tutorial dialogue syst...

متن کامل

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

Coherence and Structure in Text and Discourse

Textual coherence versus discourse structure Coherence is one of the most general and most widely discussed concepts in the study of text and discourse In spite or perhaps because of its central status the concept of coherence has many di erent and often incompatible de nitions and connotations For text linguistics or psycholinguistics with their focus on the representation and processing of in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational linguistics

دوره 38 3  شماره 

صفحات  -

تاریخ انتشار 2012